Asymptotic results on adaptive false discovery rate controlling procedures based on kernel estimators

نویسنده

  • Pierre Neuvial
چکیده

The False Discovery Rate (FDR) is a commonly used type I error rate in multiple testing problems. It is defined as the expected False Discovery Proportion (FDP), that is, the expected fraction of false positives among rejected hypotheses. When the hypotheses are independent, the Benjamini-Hochberg procedure achieves FDR control at any pre-specified level. By construction, FDR control offers no guarantee in terms of power, or type II error. A number of alternative procedures have been developed, including plug-in procedures that aim at gaining power by incorporating an estimate of the proportion of true null hypotheses. In this paper, we study the asymptotic behavior of a class of plug-in procedures based on kernel estimators of the density of the p-values, as the number m of tested hypotheses grows to infinity. In a setting where the hypotheses tested are independent, we prove that these procedures are asymptotically more powerful in two respects: (i) a tighter asymptotic FDR control for any target FDR level and (ii) a broader range of target levels yielding positive asymptotic power. We also show that this increased asymptotic power comes at the price of slower, non-parametric convergence rates for the FDP. These rates are of the form m−k/(2k+1), where k is determined by the regularity of the density of the p-value distribution, or, equivalently, of the test statistics distribution. These results are applied to oneand two-sided tests statistics for Gaussian and Laplace location models, and for the Student model.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Proof of FDR Control Based on Forward Filtration

For multiple testing problems, Benjamini and Hochberg (1995) proposed the false discovery rate (FDR) as an alternative to the family-wise error rate (FWER). Since then, researchers have provided many proofs to control the FDR under different assumptions. Storey et al. (2004) showed that the rejection threshold of a BH step-up procedure is a stopping time with respect to the reverse filtration g...

متن کامل

Asymptotic Behaviors of Nearest Neighbor Kernel Density Estimator in Left-truncated Data

Kernel density estimators are the basic tools for density estimation in non-parametric statistics.  The k-nearest neighbor kernel estimators represent a special form of kernel density estimators, in  which  the  bandwidth  is varied depending on the location of the sample points. In this paper‎, we  initially introduce the k-nearest neighbor kernel density estimator in the random left-truncatio...

متن کامل

An Adaptive Step - down Procedure with Proven Fdr Control under Independence

In this work we study an adaptive step-down procedure for testing m hypotheses. It stems from the repeated use of the false discovery rate controlling the linear step-up procedure (sometimes called BH), and makes use of the critical constants iq/[(m + 1− i(1− q)], i= 1, . . . ,m. Motivated by its success as a model selection procedure, as well as by its asymptotic optimality, we are interested ...

متن کامل

Asymptotic Properties of False Discovery Rate Controlling Procedures Under Independence

This paper investigates the performance of a family of multiple comparison procedures which have been designed to provide strong control of the False Discovery Rate (FDR). The FDR is the expected False Discovery Proportion (FDP), that is, the expected fraction of false rejections among all rejected hypotheses. Starting from the Benjamini-Hochberg procedure [1], a number of refinements have been...

متن کامل

Generalized estimators for multiple testing : proportion of true nulls and false discovery rate

Two new estimators are proposed: one for the proportion of true null hypotheses and the other for the false discovery rate (FDR) of one-step multiple testing procedures (MTPs). They outperform existing such estimators when applied to discrete p-values whose null distributions dominate the uniform distribution and reduce to leading such estimators when applied to continuous p-values. For the new...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of Machine Learning Research

دوره 14  شماره 

صفحات  -

تاریخ انتشار 2013